Real Time Snapshot Collection Algorithm for Mobile Distributed Systems with Minimum Number of Checkpoints
نویسندگان
چکیده
Checkpointing is an efficient way of implementing fault tolerance in distributed systems. Mobile computing raises many new issues, such as high mobility, lack of stable storage on mobile hosts (MHs), low bandwidth of wireless channels, limited battery life and disconnections that make the traditional checkpointing protocols unsuitable for such systems. Minimum process non-blocking coordinated checkpointing may be useful for mobile distributed system as this approach is domino-free, requires at most two checkpoints of each process on stable storage, forces only interacting processes to checkpoint and does not suspend their underlying computation during checkpointing. Sometimes, it also requires piggybacking of information onto normal messages, blocking of the underlying computation or taking some useless checkpoints. In this paper, we propose a non-blocking minimum process coordinated checkpointing algorithm that requires minimum bandwidth over wireless channels and does not requires any induced/forced or mutable checkpoints and reduce the height of checkpointing tree without taking any extra overhead in real time.
منابع مشابه
Low-Cost Checkpointing and Failure Recovery in Mobile Computing Systems
A mobile computing system consists of mobile and stationary nodes, connected to each other by a communication network. The presence of mobile nodes in the system places constraints on the permissible energy consumption and available communication bandwidth. To minimize the lost computation during recovery from node failures, periodic collection of a consistent snapshot of the system (checkpoint...
متن کاملA Low Overhead Minimum Process Global Snapshop Collection Algorithm for Mobile Distributed System
Coordinated checkpointing is an effective fault tolerant technique in distributed system as it avoids the domino effect and require minimum storage requirement. Most of the earlier coordinated checkpoint algorithms block their computation during checkpointing and forces minimum-process or non-blocking but forces all nodes to takes checkpoint even though many of them may not be necessary or non-...
متن کاملA Distributed Consistent Global Checkpoint Algorithm for Distributed Mobile Systems
A distributed coordinated checkpointing algorithm for distributed mobile systems is presented. A consistent global checkpoint is a set of states in which no message is recorded as received in one process and as not yet sent in another process. It is used for rollback when process failure occurs. A consistent global checkpoint must be obtained for any checkpoint initiation by any process. This p...
متن کاملA Nonblocking Coordinated Checkpointing Algorithm for Mobile Computing Systems
A checkpoint algorithm for mobile computing systems needs to handle many new issues like: mobility, low bandwidth of wireless channels, lack of stable storage on mobile nodes, disconnections, limited battery power and high failure rate of mobile nodes. These issues make traditional checkpointing techniques unsuitable for such environments. Minimum-process coordinated checkpointing is an attract...
متن کاملAnti-message Logging Based Coordinated Checkpointing Protocol for Deterministic Mobile Computing Systems
A checkpoint algorithm for mobile computing systems needs to handle many new issues like: mobility, low bandwidth of wireless channels, lack of stable storage on mobile nodes, disconnections, limited battery power and high failure rate of mobile nodes. These issues make traditional checkpointing techniques unsuitable for such environments. Minimum-process coordinated checkpointing is an attract...
متن کامل